Empirical Exploration of Novel Architectures and Objectives for Language Models
نویسندگان
چکیده
While recurrent neural network language models based on Long Short Term Memory (LSTM) have shown good gains in many automatic speech recognition tasks, Convolutional Neural Network (CNN) language models are relatively new and have not been studied in-depth. In this paper we present an empirical comparison of LSTM and CNN language models on English broadcast news and various conversational telephone speech transcription tasks. We also present a new type of CNN language model that leverages dilated causal convolution to efficiently exploit long range history. We propose a novel criterion for training language models that combines word and class prediction in a multi-task learning framework. We apply this criterion to train word and character based LSTM language models and CNN language models and show that it improves performance. Our results also show that CNN and LSTM language models are complementary and can be combined to obtain further gains.
منابع مشابه
Estimation of Global Solar Irradiance Using a Novel combination of Ant Colony Optimization and Empirical Models
In this paper, a novel approach for the estimation of global solar irradiance is proposed based on a combination of empirical correlation and ant colony optimization. Empirical correlation has been used to estimate monthly average of daily global solar irradiance on a horizontal surface. The Ant Colony Optimization (ACO) algorithm has been applied as a swarm-intelligence technique to tune the c...
متن کاملA Novel Intelligent Water Drops Optimization Approach for Estimating Global Solar Radiation
Normal 0 false false false EN-US X-NONE AR-SA MicrosoftInternetExplorer4 Measurement of solar radiance demands expensive devices to be used. Alternatively, estimator models are used instead. In this paper, a new method based on the empirical equations is introduced to estimate the monthly average daily global solar radiation on a horizontal surface. The proposed method uses Intelligent Water ...
متن کاملNumerical analysis of reactant transport in the novel tubular polymer electrolyte membrane fuel cells
In present work, numerical analysis of three novel PEM fuel cells with tubular geometry was conducted. Tree different cross section was considered for PEM, namely: circular, square and triangular. Similar boundary and operational conditions is applied for all the geometries. At first, the obtained polarization curve for basic architecture fuel cells was validated with experimental data and then...
متن کاملImproving the performance of financial forecasting using different combination architectures of ARIMA and ANN models
Despite several individual forecasting models that have been proposed in the literature, accurate forecasting is yet one of the major challenging problems facing decision makers in various fields, especially financial markets. This is the main reason that numerous researchers have been devoted to develop strategies to improve forecasting accuracy. One of the most well established and widely use...
متن کاملDesign of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems
Hybrid Wireless Network-on-Chip (WNoC) architecture is emerged as a scalable communication structure to mitigate the deficits of traditional NOC architecture for the future Multi-core systems. The hybrid WNoC architecture provides energy efficient, high data rate and flexible communications for NoC architectures. In these architectures, each wireless router is shared by a set of processing core...
متن کامل